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ABSTRACT 



A new method for establishing lower bounds on the 
number of multiplications and divisions required to compute 
rational functions is described. The method is based on 
combining two known methods, dimensionality and rate of 
growth. The method is applied to several problems and 
new lower bounds are obtained. 
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Combining Dimensionality and Rate of Growth Arguments for 
Establishing Lower Bounds on the number of Multiplications 

!• Introduction 

In this paper we describe a new method for establishing 

lower bounds on the number of multiplications and divisions 

required to compute rational functions. If F is a field and 

a, ....ta are algebraically independent (indeterminates) over 
1' n 

F then, as usual, F (a.^ . . . ,a n ) is the field of the rational 
functions of a ir ...,a over F. For various F and H c F we 
shall consider algorithms over (F (a 1 , . . . , a n ) , HU{a^, . . . ,a n > ) , 
namely algorithms computing elements of F(a^,...,a n ) by 
applying chains of rational operations to elements of 

H U{a lf . . . ,a n }. 

The problem of establishing lower bounds on the number 
of operations was tackled first using dimensionality arguments 
Namely, describing the method informally, it was proved that 
algorithms computing certain sets of rational functions had 
to use at least one multiplication or division to introduce 
one or several out of many of the indeterminates appearing 
in the functions being computed. Using these arguments no 
lower bound greater than the number of the indeterminates 
appearing in the functions can be established. These methods 
were used first for polynomials by Ostrowski [4] and Pan [5] 
and later by Kinograd [9] and [10] for sets of rational 



functions which are linear in some of the indeterminate s (note 
that multivariate polynomials are linear in their coefficients) 
Finally, Strassen [7] used these arguments for sets of arbi- 
trary rational functions. The methods were mostly based on 
dimensionality arguments in linear algebra. A typical result, 

due to Pan, states that at least n multiplications and 

n 
divisions are required to compute Z a. x 1 over a suitable 

i=l 
domain. 

Another approach, due to Strassen [8]^ used algebraic 

geometry to establish rate of growth arguments. This approach 

allowed Strassen to prove among other things that the 

n 
computation of the set {Z a?|l < j < n} requires at least 

1=1 

n log- — multiplications and divisions. 

For the case of the computation of bilinear forms 
special methods for establishing lower bounds were developed 
by, among others, Hopcroft and Musinski [3] and Brockett and 
Dobkin [2]. 

Our method will allow us, in certain cases, to combine 

dimensionality and rate of growth arguments, so as to find 

lower bounds which could not be obtained using the previous 

methods. Among other results we obtain lower bounds on the 

number of multiplications required to compute a. x, a 2 x^ , . . . , a R x 

or the polynomial £ a.x a(l) for < a(l) <...< a(n). 

i=l X 



n 



2. Previous results and other preliminaries 

Definition 1 . Let E be a field and let D <= e. An 
algorithm of length N over (E,D) is a sequence of instructions 

A of length N, each instruction of one of the following 
types : 

(1) A [i] : = input d; 

(2) A [i] : = A.[j] u Atk] 

where i,j,k € {1,...,N}, d € D , w € {+,-,x, : } and for 
i,j,k appearing in (2) j,k < i. We shall sometimes say in 
case (2) that instruction A [i] (with a label i) refers to 
instructions A[j] and A [k] • 

With every A a partial function Eval (A) : (1,...,N)— 1 
is associated and defined inductively by 

If A[i] : = input d; then Eval (A) [i] = d. 

If A [i] : = A[j].u A[k] ; then if Eval (A)[j], 
Eval (A) [k] , Eval (A) [j] u Eval (A) [k] ; are defined, 
then Eval (A) [i] = Eval (A) [j ] w Eval (A) [k] . 
In other cases Eval (A) [i] is not defined. From here on 
we shall consider only algorithms A for which Eval (A) 
is total, as there will be no interest in discussing the more 
general case . 

For a subsequence B of A , v;e define 

Comp ( B ) £ { Eval (A) [i] | A[i] c B }. We shall 
also say that B computes { e 1 '«--/ e m ^ (respectively e) if 



{e,,...,e } c Comp ( B ) (respectively e 6 Comp ( B ). 

The definitions above are based on those of Winograd [9] 

Example 1 . To clarify the notions which have been 
introduced above we give a very simple example. Let Q 
denote, as usual, the field of rationals and let x be an 
indeterminate over . Then the following algorithm over 

( Q (x) , Q U {x}) computes 1 + x + x z + x 3 = Z-^i- 

x-1 



l 
1 
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4 
5 
6 
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A 








Eval (A) 


A [1]: 


= 


input 2 ; 




2 


A [2] : 


= 


A [1] : 


A [1]; 


1 


A [3]: 


= 


input x, 




X 


A [4]: 


= 


A [3] x 


A [3], 


x 2 


A [5]: 


= 


A [4] x 


A [4], 


x- 


A [6]: 


= 


A [5] - 


A [2] 


x"-l 


A [7]: 
A [8]: 


= 


A [3] - 
A [6] : 


A [2], 
A [7] 


x-1 

x 4 -l 
x-1 



Let F be an infinite field and let G be an infinite 

subfield of F. Let a, .....a be indeterminates over F. For 

1' ' n 

some sets H such that G ^ H c F we shall be interested 
in studying algorithms over (F(a, , ...,a ), H U {a.,..., a }). 
It will be useful to have a "pool" of parameters. To this 
end we shall have an infinite sequence (c. |q £ i <_ °°) of 



indeterminates over F which does not contain any elements 
from (a.|l < i < n) . Let G H , F denote 6(00,0!,...)/ 
H(c Q ,c lf ...) and F(c ,c 1 ,. . .) respectively. Then any 
algorithm over (F (a 1 , . . . , a ) , H U {a 1 ,...,a n >) can be 
considered to be over (F (a, , . . . ,a ) , H U {a^ , . . . , a n ,cq ,c\. . . } ) 
The purpose of introducing G and Cq,c-j_,... will become 
clear later. To simplify the notation, we shall write * for 
either * or : . Furthermore, we shall sometimes write a for 
either {a,, ,a } or a ,...,a n , c for either {c Q ,c ir . . .} or 

Our purpose is to be able to combine two distinct 
methods for establishing lower bounds. To this end we first 
divide the MDs (multiplications and divisions) into several 

classes . 

Let F c e and G <= d. Then E n F c ^ p and 

G flD/f . We shall characterize MDs in algorithms over 
c 

(E,D) according to vAiether the operands belong to G c , 

F , E-F etc. 
c ' c 



Definition 2 A MD A[i]: = A[i] * A [k] in an 
alaorithm over (E,D) vfoere F c E, G c D will be 



(1) strongly counted (a SCMD) if 

when * = x then Eval (A ) [ j ] € E - F, and Eval (A) (k) € E 

- G or Eval (A ) [ j ] € E - G and 
Eval (A) [k]€ E ~ F c '* 

when * = : then Eval (A) [ j]€ E - F and Eval(A)[k]£ E 

- G or Eval (A) [ j] € E - {0} and Eval (A) [k]6 E 

" F c ; 



(2) weakly counted (a T7CMD) if 

when * = x then Eval (A) [j] , Eval (A) [k] 6 F C ~G C ; 

when * = : then Eval (A) [j] €F c -{q} and Eval ( A) [k] ^ C ~ G C > 



(3) not counted (a NCMD) if 

when * = x then E val ( A) [j] € G or Eval (A) [k] €G ; 
when * = : then Eval (A) [k] 6 G c . 



We shall sometimes refer informally to 
Eval (A)[j] * Eval (A) [k] instead of A [i]: = A [j] * A [k] 
and say that a 1 * a 2 for a i' °2 € E is a SCMD etc * 



Example 2 . Set F = R (the field of reals), 
O = Q , H = U { 4 /2}. Then in the following algorithm 
over ( F( a;L ) ,H U {a 1 )) . 



i 






A 




Eval (A) 


1 


A 


[1]: 


= 


input 4.2; 


4.2 


2 


A 


[2]: 


= 


input /2 


/2 


3 


A 


[3]: 


= 


input a, ; 


a l 


4 


A 


[4]: 


= 


A [1] x A [3] ; 


4.2a 1 


5 


A 


[5]: 


= 


A [2] x A [2] ; 


/r 


6 


A 


[6]: 


= 


A [1]: A [3]; 


4.2/a, 



A [4] is a NCMD, A [5] is a x- 7 CMD and A [6] is a SCMD. 

Every MD is in exactly one of the three classes. As 
every MD can be simulated by SCMDs and ASs (additions and 
subtractions) there is no way of establishing nontrivial 
lower bounds on the number of !*7CMDs or NCMDs appearing in 
arbitrarv algorithms computing certain rational functions. 
We would obviously prefer to give lower bounds on the total 
number of HDs, but we do not have good methods for this. 
Instead, we shall give lower bounds on the number of CMDs 
(counted MDs) which comprise both SCMDs and T-7CMDs. 



We start by citing a well known characterization of 
of algorithms without SCMDs [9]. 

Lemma 1 . Every element computed by an algorithm A over 
<F (a), HU {c,a} which has no SCMDs is of the form 



f + ? g a. f g. € G c , f €F c 
i=l 



(1) 



Proof. 



By induction on the length of A 
[ol Trivial 



N-HJ We only remark that if 

(f 1 + Eg} a.) * (f 2 + lq\ a ± ) is not a SCMD , 
then the result of this KD is of the form (1). n 

In the interest of makincr the exposition as clear as 
possible some of our proofs will not be completely formalized. 
For example, in some proofs, instead of using a formal 
induction on the length of algorithms we shall use more 
informal arguments . 

We quote now (using our formulation) the theorem which 
established the dimensionality arguments [7]. 

Theorem 1 (Strassen) . Let A be an algorithm over 
(F(a 1 ,... ,a ) H U {a ± , . . . , a n > ) computing Y (a) ={i|» i (a) | 1 < i <_ t) 
usinq m < n SCMDs. Then there exists a (Zariski) -dense 
subset S of F n , such that for every s € S there is an 
n x (n-m) matrix T over G of rank n-m, such that 



¥' (b) = {i|>. ( rb + s_) I 1 <_ i £ t} can be computed without 

SCMDs over (F (b x , . . . ,b n _ m ) , H U {b , . . . ^ n _ m H where 

b = (b, ,....b ) is a sequence of indeterminates over F. G 
— 1 n— m 

Although Strassen did not use our classification of 
MDs, his proof can be used for the above theorem. 

Remark 1 . Similarly to Lemma 1, it can be shown that the 

elements of ¥' are of the form 

n-m 
f + E g. b. , g. € G , f € F . G 
i=l 

To show how Theorem 1 is used we present a slightly 
strengthened version of Pan's result. 

Corollary 1 . Let x be an indeterminate over G and let 

F = G(x). Then every algorithm over (G(x ,a) , GU(x,a}) 

which computes I a.x a(l) for < a(l) < ... < a(n) 

i=l x 

requires at least n SCMDs. 

Proof . Assume that there exists A satisfying the assumptions 
of Theorem 1 which has only n-1 SCMDs. Then there exist 

s l' ■ • • ' S n € G (X) and g l' ' * * ' g n € G not a11 ° f them Zer °' 

such that I (g £ bj_ + s ± ) x a(l) = B (x) b ± + a(x) can be 

i=l 
computed without SCMDs. But, on the other hand, it is 

impossible by Remark 1 as &(x) € G(x) - G . G 
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For a set ¥ and an algorithm A computing V ve 
shall denote by y n CF) and y D (A) respectively the lower 
bound on the number of SCMDs which can be obtained by an 
application of Strassen's theorem. Similarly, we shall 
denote by u (?) and p(A) the number of CMDs required 
to compute Y by an algorithm A. 

In manv cases it is simpler to use Winograd's 
theorem [9] which can also be obtained as a corollary 
to Strassen's theorem. 

Corollary 2 (Winograd) . Let A be an algorithm over 
(F(a 1 ,...,a n ) , II U {a 1 ,... f a n }) computing $ a + <{> where 
$ is t x n matrix and <f> a t-vector of elements in F. 
Furthermore, assume that there are m columns in $ such 
that no nontrivial linear combination of them over G 
is in G t (obviously it is in F t ) , Then A has at least 
u (A) = m SCMDs. □ 



II 



3. Main results 
We shall now describe briefly and informally the main 
idea behind our method. We start with an algorithm A over 
(F (a) , H U a) which computes V (a) . This algorithm has 
y(A) CMDs (we want to find a lower bound on y(A)), at least 
u of them SCMDs. V7e transform this A into an alaorithm 
B over (F , H U c) . This algorithm computes certain A and 
¥ (A) and is obtained from A by (among other operations) a 
substitution of A for a which reduces at least y D SCMDs 
into NCMDs. Thus, we shall have y(B) <_ y (A) - y D (A). We 
can now use rate of growth arguments to give a lower bound, 
say u r (B), on the number of CMDs in B. Thus, y(B) >_ U Q (B) 
and y(A) _> V D (A) + U G ( B )- '- e shall show that for some 
¥(a) there are nontrivial lower bounds on U G ( B ) which do not 
depend on a particular B. Thus, we can establish a lower 
bound on y (¥). 

The following is our main theorem: 
Theorem ?. Let A be an algorithm over (F (a^, . . . ,a ) , 
H U {a 1 ,..., a. }^ commuting ¥ (a) = {^(a)|l < j < t}. Then 

there exists an al - orithm B over (F(c 1 , ' c n ^' HUlc^...,^}) 

computing u= (u ± (c 1 , . - . ,0.^) | 1 £ i < n) , v = (v i (c ir . . . ,c i _ 1 ) 
|1 < i < n) ,A = (A ± |l < i < n) and Y (A) = {^ (A) 1 1 < j < t} 
such that 
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(1) v (B) < p(A) - U D (A) , 



(2) Comp (B) c: r, (f? U {a,...,a }) (the set of rational functions 

i lf ...,a n ) over G^ 



c 
in elements of H U {a 1f ...,a n } over G n ) , 

n 



(3) A. = u. + c. v. + Z w?(u. + c. v.) 

1 x x x j= i+ l L 3 3 3 



i 



(4) u i# v^^ € F (c ][/ . . . ,c ± . x ) , w? € G(c lf ... ,c. 2 ) , 

(5) A is a permutation of .A 

(6) If there were no SCDs (strongly counted divisions) in 
A, then v. = 1, i = l,...,n, 

(7) If there were no CDs (counted divisions) in A, then 

Conp (B) £ G [H U {a ,'...', a }] (the set of polynomials in 

elements of H U {a,,..., a } over G ). 

x n c 

Before proving the theorem we shall prove some results 
which will be useful in its proof. 

If A is an algorithm over (F (a) , H (J {a,c}) without 
fiCMDs, then b^ Lemma 1 every element computed by A is of the 
form (1). T'.e shall show that it is possible to "reorcranize" 
A so as to obtain B without increasing the number of CMDs 
and which computes all the f's appearing in elements of the 
form (1) and then all the other elements computed by A. 
To state the result formallv: 
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Lemma 2 . Let A be an algorithm over (F (a) , !! U (£/£}) 

without SCriDs. Then there exists an algorithm B = B,B 

(concatenation of two seauences of instructions B. and Bp) 
over (F (a), H U (c,a}) such that 

(1) y(A) = y(B) , 

(2) Comp (A) c Comp (B) , 

(3) Comp (Bj) - F c , 

(4) {f |f + Eg i a i G Como(A) } c Comp (E^) • 

Proof . T.-.je note first that by Lemma 1 every element in Comp 
(A) has the form (1). I 7 e shall first construct B 1 , ty 
induction on the length of A. 
RTI B, is empty. 

| n+l\ Let A' consist of the first N instructions of A and 
let B' satisfy the lemma for it. 7a e shall now define B^ 
There are several cases to be considered. 

(1) A[K+1] : = input d; 

If d € F then add this instruction (changing the 
c 

label if necessary) to b| obtaining B^ ; otherwise, 
set B, = B, . 

(2) A[N+1] : = A[j] to A[k] ; 

then 

Eval (A) [j] = f j + ^g{a i , 

Fval (A) [k] = f k + Ig k a i ' 
and f^ ,f k G Comp (B^) . 
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There are several subcases to be considered: 

(a) u £ {+,-}. In this case add the instruction 

computing f J o)f to B' obtaining B, , 

~i i Y k 

(b) w e {*,:} and g^ = ... = q^ = g£ = ... = g R = , 

Then add or A [n+1] to B£ 

obtaining B.. . 

(c) a) € {x, : } and {g^,...,g^, g£,...,g^} 7* t°}« 
Then either g? = . . . = g- 1 = or 

g ,=...= a =0. If, sav a = ... = a = u 

^1 n - 1 n 

then f 1 f ~ G (otherwise this would be a SC.MD) 
c 

i k i 
and add the instruction computing f J wf ' to B. 

obtaining B, . 

r ve saw that in all the three subcases we added the instruction 

i k 
f - wf to B ' , but we treated them separately so the reader 

could convince himself that no CMDs were added. 

Now it is possible to add instructions, which are not 

CMDs to B. which compute all the elements of 

Eval (A) - Eval (B ) thus defining B = B^ . O 

Corollary 3 . Let B = B^ satisfy the conditions of Lemma 

2 and let f £ Comp (B 1 ) , g ' g l''**' g n € G c * Then it iS 
possible to construct B' = B x B ! 5 B 2 over ( F c (a),H U{c,a}) 
such that 

(1) g Q f + Eg i a i € Comp (B 1>5 ) , 

(2) y(B') = y(B) . 
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Proof . B' is obtained from B by (informally) inserting 
between B, and B„ instructions which are not CMDs and 
which compute g Q f + Eg. a. using f which was computed by 

B l ' 

Proof of Theorem 2 . Instead of proving the theorem formally 
by induction on (say) length of A we shall show how to 
transform every A into an appropriate B satisfying the 
theorem. 

We consider the first SCMD A [i] : =A [j ] *A [k ■] . Let 
a, ^ Eval (A)[j] f a 2 = Eval (A) [k] . Then at least one of 
a, and a- , denote it by a , has the form f + Eg^^ 
where 

(1) f f G(H) , 

(2) g;L ,...,g n (EG , 

(3) {g 1 ,...,g n l t to}. 

If both operancfe have this form and satisfy condition (3) 
let a = a-. By (possibly) permuting the vector a and 
using the same symbol, a, to denote the permuted vector, 
we may assume that g, ¥■ . We shall consider two cases: 

(1) If a * a 2 = a i x 2 or a E °2 then consider 
the equation f + Eg. a. = g,c, and define v 1 = 1 . 

(2) If a i * °2 ~ °l :a 2 and aE °l then consider 
the equation f + Eg ± a i = g 1 v 1 c 1 and define v 1 & a 2 (we 
know that a 2 € F - {0}) . 
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Solving the appropriate equation for a 1 and naming 

n 1 
the solution A 1 we have A x = u ± + c^ + I g 2 a ± where 

u l = _f/g l g i = " g i /g l * 

Denote by A the algorithm obtained from A by 

taking all the instruction up to, and not including, the first 
SCMD, This algorithm obviously has no SCMDs and therefore 
using Lemma 2 and Corollary 3 we may transform A into 
B n B n C B, where B. and B_ satisfy the conditions of 
Lemma 2 and B x 5 computes u 1 ,v 1 and A 1 . Thus we may 
assume that A is already in the form B 1 B 1>5 B 2 ' and A 
is an algorithm over (F(c 1 ,a) , F U {c-^a}) . 

We construct now h 1 from A by (informally) sub- 
stituting A 1 for a x . More formally, we drop instructions 
with input a, and each instruction which refered to one of 
them will refer to the instruction which computed h-^ . Thus 
A, is an algorithm over (F (c 1 ,a 2 , . . . ,a n ) , H U {c ir a 2 , . . . #a n >) 
We note first that Eval (A^ is total, namely there are no 
attempts in h ± to divide by zero. To show this it is 
enough to show that if ty (a 1 ,a 2 , . . . ,a n ) € F ia^i^ ,... ,a n ) - {0} 
then ij;(A 1 ,a 2 ,...,a n ) € F (c^ , . . . ,a n > - {0} . But this 
follows immediately because: 

(1) if ^(a ir a 2 ,.. . ,a n ) is not a function of a x 
then iMa^a^ — » a n ) = ^(A^a^ — ,a n ) , 

(2) if ^(a 1 ,a 2 ,...,a n ) is a nontrivial function of 
a x then i|> (A^a^ . . . ,a n ) is a nontrivial function of c^ . 
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We also see that y (A.^ < y(A) - 1 . First we note 
that the instruction in A. which corresponds in an obvious 
way to the first SCMD in A is a NCMD. Indeed there are two 
cases corresponding to the two equations above: 

(1) in this case the SCMD is transformed into 
a,*g,c 1 or g^c.*^, 

(2) in this case the SCMD is transformed into g 1 c 1 
which can always be computed without CMDS. 

In addition we note that any operation in A which was 
not a CMD is transformed into an operation in IK^ which is 
not a CMD. Thus the number of CMDs has been reduced by at 
least one. To show this we only remark that if 
if)(a 1 ,a 2 ,. .. ,a n ) 6 G then ip (A x ,a 2 , . . . ,a n ) <G G. 

Finally, we summarize for the reader that A.^ computes 

u l' v l' A l' ? ( A 1 ' a 2'*-* ' a n J and ^ (A i } £M A > - 1 • 

Let us now consider the first SCMD in A 1 . Once again 

we consider a i*°2 and at least one of a i and a 2 ' 

n 
denote it by a , has the form f + Z g.a. where 
J 2 x x 

(1) f 6 G(H U {c ± }) , 

(2) g 2 '* , *' g n € G(c i ) ' 

(3) {g 2 ,...,g n } ^{0}. 

If both operands satisfy these conditions, pick the second 
one. Similarly to above we consider an appropriate equation 
which would reduce this SCMD into an operation which is not 



a CMD and construct an algorithm A over 

(F(c 1 ,c 2 ,a 3f — ,a n ) , H U {c lf c 2 ,a 3 ,...,a }) 

which computes 

n 1 
U 1' V 1' A 1 " u l + c l v i + Z ?i a i ' 

u 2 ,v 2 ,A 2 = u 2 + c 2 v 2 + | g^a. 
and ¥(A lf A 2 ,a 3 ,.. . f a ) 

where u 2 ,v 2 € Ffc^ (actually u 2 ,v 2 € G(HU { C;l }) ), 
9 L e Gic^ and y(A 2 ) < y^) - 1 < y(A) - 2 . 

We proceed in this manner m £ n times, until there 

are no SCMDs left. Then A computes u ir ...,u , 

m c 1' ' m ' 

V l V m ' A l A m and ¥(A 1 A m ' a m+l a n ) 

OVer (F(C 1 c m ' a m + l a n ) ' H {c l C m ' 

a m+l' * * * ,a n^ * A l'***' A m can i3e wr "i tten i n terms of 
a , , . . . ,a as following: 

n-m 
A. = s. + T y • a (2) 

where s ± € F(c 1 ,...,c m ) , y if j fGlc^...,^). Setting 
now b 1 s a m+1 , . . . ,b n _ m = a n we may say that A is an 
algorithm over (F c (^ , . . . ,b n _ m ) , H c U {1^, . . . ,b n _ m > ) 
which computes * ' (b) = ¥ (s + Tb) where r = (y ■ • ) is 
defined by 
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v. . from equation (2) 



Y i,j 



1 < i < m 

m < i <_ n , j=i 

m < i < n , j^i 



r is an n x (n-m) matrix of rank n-m and from 
here using Strassen's result it is possible to show that 
m = U D . Thus y(A m ) < y(A) - U D (A) . 
Now we set 

u .=0 v.=l i = m+l,.../n 

g* = i = m+l,...,n j = i+l,...,n 
A . = c . i = m+ 1 , . . . , n . 

Thus we have the following system of equations 

n i ~ 
A. = u. + c.v. + I g. A. i=l,...,n 

1 x x 1 j-i+1 3 3 
where u i ,v i fFlc^...,^). g^ G G(c ir . . . ,c i-]L ) 
and where we introduce a\ instead of A^^ as we are 
actually dealing with a permutation of A i . This system 
being an upper diagonal can be very easily solved so as to 
give the solution claimed by the theorem. The last 
algorithm constructed was /^ and it was over 

(F(c 1 ,...,c n ,a m+1 ,...,a n ), H U {c lf ...,c n ,a m+1 ,...,a n }) 

We substitute in it c ± for a ± , j = m+l,...,n obtaining 
the required algorithm B (we can of course assume that it 
computed u m+1 ,...,u n ,v m+1 ,...,v n , A m+1 ,.. . . ,A n > . Finally 
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it also follows easily that if i|i (a) € F (a) - {0} then 

iJ>(A) € F ~ tO} and there are no attempts in B to divide 

by zero. □ 

Corollary 4 . Under the assumptions of the theorem there 

exists B over (F c , H U c) computing u = (u ± | 1 < i < n) , 

v = (v. |1 < i < n) , A = (Ajl < i < n) and 

Y(A) = { \p . (A) 1 1 < j £t} such that 

(1) p(B) < u(A) - u D (A) , 

(2) Comp (B) 6 G (H U {a,...,a n >) , 

m • 

(3) A. = u. + c x(i) v. + _J +i w^ (u. + c i(j) v.) , 

(4) u., Vi e F(c i(1) ,...,c i(i _ 1) ) , wj € G(c i(i) ,...,c i(i _ 1) ), 

(5) i:{l,...,n) ■*■ U,...,n} is a permutation, 

(6) if there were no SCDs in A then v.^ = 1 , 
i=l, . . . ,n , 

(7) if there were no CDs in A then 
Comp (B) <= G c [H U {a lf ...,a n }] . 

Proof . Follows from Theorem 2 by giving the permutation 
of A explicitly. n 

Corollary 5 . Under the assumptions of Theorem 2 we may 

assume that there exist a = (a i |l < i < n) , 

B - (3. |1 < i < n) s.t. B computes a , 3 , A and V (A) and 

A i(k) = a i(k) + Y x(k) C i(X(k)) 5 x(X(k)) llk - n 
where 
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°i(k) € F(c i(l)"--' c i'(X(k)-X) ) . f 



B i(X(k)) €P(c t(l) c i(X(k)-l) ) " {0} ' 

Y i(k) €G < c t<l)'"-' C i<X<k>-2>> - {6V ' 
k < X(k) < n 



and 



(1) if there were no SCDs in A then 
3 i(X(k)) " X • 

(2) if there were no CDs in A then 
Comp (B) € G C [H U {a x ,..., a n )l. 

Proof . He use the form for A derived in Corollary 4 and discuss 
two cases 

(1) {w£ +1 ,...,wg> + . Let 

X(k) * max{j|w k * ? 0} and set 

•i <k) * «k + <\ (k) v k * x *3Li ** (Uj + ° x (j>Vj> 

+ w k i(k) 
B i(X(k)) * v X(k) 

\{k) W k ' 

(2) m£ +1 ,...,i£> - . Let X<k) • k and set 
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4. Applications 

From now on we assume that F = G(x) , H = G U {x} 
where x is an indeterminate over G . In this case we 
shall also write sometimes explicity 

A x(k) (x ' c i(l)"-"S(Mk)) ) = a x(k) (x ' C i(l) c i(X(k)-l)) } 

+ Y i(k) (c i(l)'--" c x(A(k)-2) )c i(Mk)) 6(x ' C i(l)"*-' C i(X(k)-l) ) 

Furthermore we shall discuss only the case ¥ (a) = <t>a + <|> 
where $ is a t*n matrix and 4> a t-vector of elements 
in F (compare with Corollary 2). In order to be able to 
deal more uniformly with this case we set a Q - c_ and 
write $a + <j>c Q = $a + <j>a Q (thus, say multiplications by 
elements from G(a Q ) = G(c Q ) won't be CMDs ) . Defining 
now a tx(n+l) matrix $ by 



*. . 



$ . . ; otherwise 



C-A. 



we may say that * (a) = $a where a = (a Q ,a^, . . . ,a n ) 

If we set A fl = c- all our results up to this point follow. 

We could have been more formal and replaced everywhere F 

by F(a.) etc., but it would serve no useful purpose. 

For an element p of G [x] (the set of polynomials 

in x over elements of G ) we shall denote by 9 (p) the 

c 

degree (in x) of p . 
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Definition 3 . Let •= ( *i,j ) i=l,...,t; j*0,...,n be 

a matrix in G[x] . Then define Sp($), the sparsness index 

of $ , as following 

Sp($) = ^^ {max {3(». rl(n) ) -3 (*!,,(„-!,) ' 

^•i.Kn-l) 1 - 9($ i,;(n-2) ) 3( *i r i(l) ) 

~ 3($ i i(0) ) ' 8($ i X (o) > l«{O f l f ...,n} - {0,1,. ..,n} 
is a permutation such that 

3( *i,Kn>> i 3 «*i f i(n-l)> ^ ••' i 3($ i,i(0)> 

Example 3 . Let t=2 , n=3 and 

/x 3 + 1 x 4 x\ /x 3 

* = V 1 x= xj *-[,* 
then 

x 3 x 3 +l x 4 x 

* - ' 2 , 5 

X 1 X Xj 

If the first row is rearranged in the order of descending 

A "3 *3 

degrees then it becomes x , x , x +1, x . Similarly for 

5 2 ^52 

the second row: x , x , x, 1 . Thus Sp($) = 3 (x )-3(x )=3 

Theorem 3 . Let A compute $a over (G(x,a) , G U {x,a}) 
Without CDs (thus necessarily * <= G[x]) . Then 
y(A) >_ y D ($a) + [log 2 Sp($)~j provided that either 
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n ,,. 

(1) All $. . are monomials of the form II a, or, 

i,D k-1 k 

(2) For every i the degrees of $. _,...,$. 

X i U l/il 

are distinct. 



Lemma 3 . Under the assumptions of Theorem 3 the rational 
functions A are polynomials in x which do not vanish 
for x=0 . 



Proof . Let <_ k £ n. Then A ... is a polynomial in x of 
the form 



a i (k) {x,c i (1) "••' c l (X(k)-1) ) 



+ Y , /1,\ (C, ,-,.,. ..,C ,, /i,\_oW C 



i (k) tc i (1) "••' c i (X(k)-2) ,c x (X(k) 



If we substitute x=0 into A ,, . we obtain a nontrivial 

i (k) 



polynomial in c 1 /\fi c \\ 



H 



Lemma 4 . Let B compute p € g [x] over (G (x) ,G U {x}) 

1 ' " C O w 

without CDs. Then y(B) >_ flog„3 (p)l 

Proof . Well known. n 

Proof of Theorem 3. By Corollary 4 there exists B which 
computes A, $A , (A = (A_,...,A )), s.t. y (B) <_ y(A)-y (A), 
and it is enough to show that at least one of the (n+l)+t 
polynomials A , $A has the degree which is at least 
Sp($). W.l.o.g. we may assume that 3 (<3? . ) < 3 ($ ■ , ) 

I/O x , x 

< ... < 3($. ) and Sp(i) = max{3($. )-3($. ,),..., 
~~ "" x f n x f n x ^ n "~ x 
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3 (* ■ , ) -" 3 (* • n ) , 3 (* • n ) > • Let < k < n be such that 
Sp ($) = 3($. k )-3(*. k _ 1 ) (for completeness: 



3($. , ) - 0). We can write 
i 1 — -»- 



n 
q= J A.$. . = q. + q_ where 



q. = J A.*. . , q = J A. $. . 



k-1 n 

I A * , q = £ 
j=0 D 1,:] ^ j=k 

and we shall show that at least one of the polynomials (in x) 

q, Aq , A.,,..., A has the degree at least Sp($) . There are 

two cases : 

(1) 3 (q) l_ Sp($) and the result follows. 

(2) 3 (q) < Sp($) . As by Lemma 3 

3(q 2 ) >_ 3(*. #k ) > 3(« irk ) " 3 («!,],.!) > SP($) 

It follows that 3^) >_ 3 (^ k ) . Thus for some j , 
£ j <_ k-1 , 

3(A.) > 3(q,)-3(*. .) > 3(*. .) ~ 3 ($ • k _-, ) > Sp($) 
and the result follows . □ 

Corollary 5 . If under the assumption of Theorem 3 A 

n . . ■> 

computes Z a.x au ' for < a (1) < . . .<a (n) 

i=l x 

then y (A) >^ n + flog? max ^ a ( n ) - a (n-1) , . . . ,a (2) -a(l) 

, a(l)}"l. 

Proof . Immediate as by Corollary 1 y (A) = n . □ 
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Let p(x) € G(x) . Then for every integer e and a 

sufficiently large integer N there is a unique sequence 

(ir . | e <_ j < N) <= G such that 

N 
p (x) = Z it . x- 5 + r (x) 
e J 

where tt. £ G and r(x) = r, (x)/r 2 (x) where r,,r, are 

polynomials satisfying 3 (r, ) - 3 (r„) < e 

Definition 4 . Let a = (a (1) , . . . ,a (m) ) be an increasing 

sequence of integers, and let p_ = (p, (x) , . . . ,p (x) ) be a 

sequence of elements in G(x) . Then p_ is a - normal if and 

only if the determinant det (tt . . . ) 4_-i m -;_t m 

N 
is not zero where p. = E tt"!" x 3 + r, /r., . like above. 

1 j«a(l) 3 L ' X *' X 

lor a sequence a like above we define y (a) - 

min{ CMDs required to compute a sequence f(q,,... f q ) c G(x) 

| (q.,...,q ) is a-normal} . 

Theorem 4 . Let A be an algorithm without SCDs over 
(G(x,a) , G U{x,a}) computing the sequence a. p, (x) , . . . , 
a p (x) where p, (x) , . . . ,p (x) is a sequence of rational 
functions having an a-normal subsequence for some a = 
(a(l) ,. . . ,a(m) ) . Then y(A) >_ V D (&) + V (a) . 

Proof . After applying Corollary 4 (we can assume that 

i(k) = k) we have an algorithm B computing (among others) 

the sequence (AvPi, I 1 1. k £ n ) where 
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Vk = lo k (x ' C l'"" C X(k)-l ) 

+ V c l c X(k)-2 ) C X(k) ] ?k (x) for 1 < k < n. 

W.l.o.g. (p,,...,p ) was a-normal. We can expand 

N _. . _ _ 

A.p. = I tt^cJx 11 + r. . (x)/r, . (x) 

1 x j-a(l) D ~ 1,X l ' 

where tFJ € G(c) , r, . , r„ . € G(x, c) , 

J J-f 1 'r 1 

3 (r, .) - 3 (r „ .) < a(l) . Thus det(?\,J is a rational 

function of c,,...,c and it will suffice to show that it 
1 n 

is a nontrivial function. From the above it follows that 

? j = ir j [c X(i) Y i (c l"--' c X(i)-2 ) + ^i,j (c l"--' C X(l)-l )] 

= *j C X(i) Y i (c l C X(i)-2 ) + lr jC i ,j< c l'---' c X{l)-l ) 

Using the well known formula concerning expansion of a 
determinant the elements of which are sums of pairs we have 

det (*j> - [jj^ c^.jY.f^ C)l(i) . 2 )] det (ttJ) 

+ n(c 1 ,...,c x(n) ) 

and from here by induction on n and analyzing n it is 
possible to show that indeed det (?^) ? . Thus 
(A,p, | 1 <_ k <_ m) is a-normal and the result follows. □ 

A slightly weaker version of the following result 
was stated by Shaw and Traub [6] and used by them to prove 
the optimality of one of their algorithms . 
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Corollary 6 . Let A compute (a, x.. , . . . ,a x ) over 
(G(x,a) , G U {x,a}) under the restrictions of Theorem 4. 
Then y (A) >^ 2n-l . 

Proof. As (a,x,...,a x n ) = $a where $. . = 6. .x- 1 
x n — i»] - L /J 

then by Corollary 2 u (A) = n . The subsequence (x , ...,x ) 
of length n-1 of (x,x , . . . ,x n ) is (2,...,n) normal and 
therefore B computes a (2 ,... ,n) -normal sequence 
A„x , . . . ,A x n . We shall show that if B computes a 
(2 ,... ,n) -normal sequence, say (p..,...,p , ) then 
y(B) >^ n-1 and the result will follow. Assume that only 
I < n-1 CMDs appear in B . Then if y^, . . . ,7^ £ G c (x) 
are the results of these CMDs then 

Pi = J ± 9i,j *j + «j 

where g. . € G and 6. € g [x] and are linear in x 
!»] c 3 c 

(6. were computed without CMDs). Dropping terms linear 
in x we have 

I 



i=l , . . . ,n-l 









Now (p, , ,p ,) is (2, — ,n) - normal and therefore 

linearly independent. Thus it cannot be spanned by a linear 
combination of I < n-1 elements y 1 ,..., Y^ • Thus 
y(B) 21 n_1 and tiie result follows. ^ 

As the next application of our method we sketch 
briefly how we can prove Brodin's result [1] that Horner's 
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rule is uniquely optimal. 

Theorem 5 . (Borodin) Horner's rule it the only algorithm 
which uses n MDs and n-1 ASs (additions and subtractions) 

to compute E a, x 1 over (G(x,a) , G U {x,a}), and 

i=l n i 

thus is the uniquely optimal algorithm computing E a i x . 

i=l 

Proof . (sketch) . It is known that n SCMDs and n-1 ASs 
are required. If we apply our method of substitution to 
Horner's rule (vhich can be described informally as 

i ± x 1 = (...(a n x + a n-1 )x + ... + a x ) x ) 



n 

E a 
i=l 



we note that i (1) = n , . . . , i (n) = 1 , B computes 

n ■ 2 

I A i (x) x 1 = -(c 2 x + c x ) x + ( - c 3 x + c 2 ) x + ...+ 

^ c n X + C n-1 } Xn_1 + C n ^ = C l X and A n = C n ' 

A. = -c.,, x + c. i=l,...,n-l . 
i i+l l 

Let now A be an arbitrary algorithm computing 
La. x 1 which has exactly n CMDs (all of them necessarily 
SCMDs). Thus y(B) < p(A) - n = n-n = and therefore 
EA. (x) x 1 , A x (x) ,..., A n (x) are necessarily linear in 
x . In addition to this using the facts that A had no 
WCMDs or NCMDs and exactly n-1 ASs it is relatively 
easy to show the claimed result. □ 

A special case of the following "decomposition" 
theorem was first proved by Winograd [11] . 
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Theorem 6 . Let A over (F(a) , H U a) compute 
¥,(3.) U ¥ 2 tfiere ^ (a) <= F(a) and * 2 c F . Let 
y(¥~) be (as usual) the minimum number ofCMDs required 
to compute ¥ 2 over (F(a) , H U a) . Then 
y (A) > y D (A) + y(4f 2 ) = VV^ }) + y(H< 2 ). 

Proof . Using Theorem 2 we obtain B computing A , 
^(A) U ^2 . Thus y(* 2 ) < u(B) < u(A) -W D (A) and 
the theorem follows. lD 

We now give an example without making the assumptions 
appearing at the beginning of this section. 

Example 4 . Let A be an algorithm over (R (a) , Q U {p,a}) 

n 2 1/M 

computing p E a. without CDs where p = 2 and 

n i = l 
M > (m+l)2 . We shall show that y(A) >'n + [loq 2 m ] . 

Assume that y(A) < n + j~ lo g 2 m ^ < log 2 M ' Then (compare 

with Lemma 4 ) it is possible to conclude that we may 

think of P as if it were an indeterminate over . Now, 

using essentially the same method which Strassen [7] used 

n 2 
to prove that an algorithm computing I a^ over 

i=l 
(R(a) ,R U a) requires at least n MDs , we can show that 

y (A) = n. The corresponding B computes p ZA i which 

is a polynomial in p over Q . While proving Theorem 2 

we noted that if i|»(a) e F <a.) - {0} then i|» (A) € F Q - {0 } . 

n 2 m 2 

Thus I A. is a nontrivial polynomial in p and p IA^ 

is a polynomial in p of degree m at least. Thus 
y(B) > |"1°99 m l anc * t * ie result follows. 






-m 



: -ss?;- 



*sa . 1 ^ £ ¥ few C*«^ C# t * «»* $ ¥i' UJ^r 
bsaitiEMtt $3© !# %«cuk?<i 'ssMSLtitlii 4^$ {&•£#* a»> act tgf I « 






anoi^fl^i*«s »d* eatfe* j*o«yiK iipm jfe--4*4t twm «w 

it:,,.; ■.:-,-. 

i:; ; H*,t») Ui), t*>*> tw Mfc**)**^* m~m ft JNfcl .jLjisBHSa- 

'* J ~*. MVf > - ; ' * ' & at 

£•:■ r ' 2 *•* * 

>, • .T* s ^oll ♦ «"< (A)c ■**& *Q4t t*m&*4* . £U+«} <S 

,wok ' . tmrp 9$mlm$$®£mi mm mtsm -ti %k m <t to 3te£i£* 

-V •■ - C 0} - 1 5 (A)* a^m (U --'<£» : * %|f ;" *4 ^»4^ fesiaci «v 

c m ■ " * ** '•'-■■; t n 

'.■'■: •.■;,. -".:' ••:' . * '1*1 



33 



5. Conclusions 

We have described a new method for establishing lower 

bounds on the number of multiplications and divisions. 

Although most of the applications were given in a rather 

restricted setting (linear functions of indeterminates were 

considered) the basic theorem was formulated in a gneeral 

framework. Actually some common but unnecessary assumptions 

were made (e.g. it was not necessary to assume that 

a,,...,a were indeterminates, and weaker assumptions would 
1 n 

have been sufficient) . As the rate of growth arguments do 
not handle divisions as easily as multiplications, our 
results were also deficient in this manner. It seems to 
us that at least some of our results can be strengthed by 
studying A and ¥ (A) more carefully. It seems that the 
method, also very useful in some cases, is inherently 
limited to proving lower bounds of the form (n) . We would 
like to mention also that it seems that inherently the same 
method can be used to handle additions and subtractions. 
Unfortunately , rate of growth arguments for additions and 
subtractions are almost nonexistent. 
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